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OF CONTENT ON DEMAND 
Priority Claim 

[0001] The present application claims priority to A METHOD AND SCALABLE 
MECHANISMS FOR EFFICIENT, SCALABLE, AND RELIABLE DELIVERY 
OF INTERACTIVE PROGRAM DATA, filed on November 13, 2003, and 
having application number 60/520,359. 

Technical Field 

[0002] The present disclosure relates to request and delivery of audio and/or 
video content, and content on demand. 

Background Art 

[0003] Several challenges arise when attempting to deliver scalable and 

reliable content-on-demand. Updated configuration information should be 
available for the set top boxes that facilitate ordering, receiving, and 
rendering of the content. Information describing categories of available 
content, and individual titles, should be made available to the set top boxes 
in a scalable fashion. The system should also provide mechanisms to 
facilitate restriction of rated content, and to provide control over navigation of 
on-demand content. 

Disclosure of Invention 

[0004] The following summary is intended to highlight and introduce some 
aspects of the disclosed embodiments, but not to limit the scope of the 
invention. Thereafter, a detailed description of illustrated embodiments is 
presented, which will permit one skilled in the relevant art to make and use 
aspects of the invention. One skilled in the relevant art can obtain a full 
appreciation of aspects of the invention from the subsequent detailed 
description, read together with the figures, and from the claims (which follow 
the detailed description). 

[0005] A content on demand system includes logic to deliver at least one 

audio and/or video stream and to insert markers in the at least one stream, 
the markers indicating a position in the at least one stream, and logic to 
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which the set top box has paused or suspended viewing, and upon a signal 
from the set top box to resume streaming of the audio and/or video stream 
from a position proximate to the marker. 

Brief Description of the Drawings 

[0006] The headings provided herein are for convenience only and do not 
necessarily affect the scope or meaning of the claimed invention. 

[0007] In the drawings, the same reference numbers and acronyms identify 
elements or acts with the same or similar functionality for ease of 
understanding and convenience. To easily identify the discussion of any 
particular element or act, the most significant digit or digits in a reference 
number refer to the figure number in which that element is first introduced. 

[0008] Figure 1 is a block diagram of an embodiment of a system for providing 
content-on-demand. 

[0009] Figure 2 is a block diagram of an embodiment of a virtual channel 
group embodied within a quadrature amplitude modulation frequency 
segment. 

[0010] Figure 3 is a block diagram of an embodiment of a virtual channel 

group, to deliver configuration and content information to a set top box, 
embodied within a quadrature amplitude modulation frequency segment. 

[001 1] Figure 4 is a block diagram of an embodiment of a broadcast 
initialization carousel. 

[0012] Figure 5 is a block diagram of an embodiment of a channel comprising 
a content index and/or a category index. 

[001 3] Figure 6 is a block diagram of an embodiment of a stream including 
markers. 

[0014] Figure 7 is a flow chart of an embodiment of a process of initializing a 

set top box for content on demand. 
[0015] Figure 8 is a flow chart of an embodiment of a set top box displaying 

information about available content. 
[001 6] Figure 9 is a flow chart of an embodiment of a process of monitoring 

markers in a stream. 
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Information into a format of audio and/or video streams. 

Industrial Applicability and Modes for Carrying Out the Invention 
[0018] . The invention will now be described with respect to various 

embodiments. The following description provides specific details for a 
thorough understanding of, and enabling description for, these embodiments 
of the invention. However, one skilled in the art will understand that the 
invention may be practiced without these details. In other instances, well 
known structures and functions have not been shown or described in detail 
to avoid unnecessarily obscuring the description of the embodiments of the 
invention. References to "one embodiment" or "an embodiment" do not 
necessarily refer to the same embodiment, although they may. 
[0019] Herein, the term "logic" refers to software (e.g. program instructions) 

and/or hardware (e.g. circuitry) to carry out processing in a device, such as a 
computing device, routing/switching device, communication device, or data 
processing device. 

[0020] Figure 1 is a block diagram of an embodiment of a system for providing 
audio and/or video streams on demand. The content-on-demand server 
system 102 comprises a content server 129 and a metadata server 125. The 
content-on-demand server system 102 further comprises a resource 
manager 127, a content manager 123, and a billing/provisioning function 
121. 

[0021] The system 102 communicates, via a network structure 104, audio 

and/or video formatted streams to set-top boxes 106, 108, 109. The system 
102 and the network structure 104 may be interfaced using Ethernet 
networking. The streams may comprise on-demand content, such as movies 
or music. The streams may also comprise configuration and content 
metadata. 

[0022] The distribution media 111 provides a shared communication media 

(such as coaxial cable) for communication to and from the plurality of set-top 

boxes 106. 108, 109. 
[0023] The server system 102 may comprise logic to compose set top box 

configuration information into an audio and/or video stream format (e.g. 

MPEG-2). Set top box configuration information may include general 
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and IP address and port numbers for communicating with the server system 
1 02), and configuration information that is specific to one or more groups of 
set top boxes (e.g. local program options). 

[0024] The server system 102, for example the content manager 123, may 
also comprise logic to compose information about multiple audio and/or 
video streams (content metadata) into an audio and/or video stream format. 
Content metadata may include information about content categories (e.g. 
horror, documentary, comedy, etc.), and information about audio and/or 
video titles (e.g. movie titles, movie length, movie description, leading actors, 
etc., as well as ads, logos, and other graphics.). 

[0025] The server system 102 may further comprise logic to compose an 
index of content metadata into an audio and/or video stream, format. The 
index of content metadata may comprise indications of which channels a set 
top box should tune in order to access content metadata, and locations of 
the data within those channels. The metadata server 125 receives and 
responds to requests from the set-top boxes 106, 108, 109 for content 
streams. The metadata server 125 communicates to the content server 129 
information about what to stream to the set-top boxes 106, 108, 109. For 
example, the metadata server 125 may identify a movie or music file to 
stream to one or more of the set top boxes 106, 108, 109 in fulfillment of an 
on-demand content request. 

[0026] The resource manager 127 provides resource allocation and load 

balancing for the content-on-demand system 102. The content manager 123 
stores information about content offerings, such as category information and 
information descriptive of content titles (collectively referred to as content 
metadata). The metadata server 125 may format the content metadata as 
one or more audio and/or video stream files (e.g. MPEG-2 or another audio 
and/or video stream format) and p i o v ide the files, along with scheduling 
information, to the content server 129. 
[0027] The billing/provisioning function 121 provides customer account 

management, provisioning, and billing. The billing/provisioning function 121 
may be notified of content requests received by the metadata server 125. 
[0028] In one embodiment the network structure 104 comprises, among other 
things, one or more user datagram protocol/internet protocol (UDP/IP) 

-4- 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/050993 



PCT/US2004/037619 



flStem? fl* to. Spnno be 
supported. The network structure 104 may also comprise one or more 
service nodes 119. The UDP/IP gateway 113 provides bi-directional 
communication for orders, logic updates, and other purposes between set 
top boxes 106, 108, 109 and the content-on-demand server system 102. 
The UDP/IP gateway 113 converts between UDP/IP communications, used 
by the content-on-demand system 102, and quadrature phase-shift keying 
(QPSK) radio frequency communications, used by the set top boxes 1 06, 
108, 109. Communication between the metadata server 125 and a set top 
box such as 106, 108, or 109 is always point-to-point, with the particular set 
top box such as 106, 108, or 109 being specifically addressed. 
[0029] The network structure 104 facilitates delivery of content-on-demand 
between the content-on-demand server system 102 and various set top 
boxes 106, 108, 109. Service nodes 1 19 may provide conversion of digital 
streams to radio frequency (RF) quadrature amplitude modulation (QAM) 
communication used by cable television systems. The service node 119 may 
include logic to insert, in the QAM/RF communication from the server system 
102 to a set top box, a service group identification (not shown) for the set top 
boxes 106, 108, 109. 

[0030] The service node 119 receives the audio/video stream in a format such 
as MPEG2 over Ethernet. The sen/ice node 1 1 9 delivers the audio/video 
stream to the medium used for the set top boxes 106, 108, 109 in a format 
such as MPEG2 over radio frequency communications. Communication of 
content-on-demand is provided using broadcast and is one way. 

[0031] Figure 2 is a block diagram of an embodiment of a virtual channel 

group embodied within a quadrature amplitude modulation (QAM) frequency 
segment. The spectrum of the QAM 202 is allocated to a virtual channel 
group (VCN) VCN 1 - VCN N. Each virtual channel within the group is 
allocated a portion of the frequency spectrum of the QAM 202. 

[0032] A set top box can tune to a particular channel in order to access 

information of the channel. A channel may comprise an audio and/or video 
stream, configuration information for the set top box, and/or content 
metadata or an index thereof. 

[0033] Figure 3 is a block diagram of an embodiment of a virtual channel 
group, to deliver configuration and content information to a set top box, 
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Set'top box configuration information is provided via an "initialization 
carousel" (IC) 333* on a predetermined channel. 

0034] Figure 4 is a block diagram of an embodiment of set top box 

configuration information, such as the IC 333 in Figure 3. The configuration 
information, which is typically provided by the content-on-demand server 
system 102, includes global configuration information 402 that applies to all 
set top boxes, and settings 403-406 for particular service groups. In one 
embodiment, the content-on-demand server system 102 may compose set 
top box configuration information expressed in Extensible Markup Language 
into an audio and/or video stream format. 

0035] A set top box may include logic to tune to a channel comprising the 

configuration information in streamed audio and/or video format, to read the 
configuration information, and to store the configuration information in 
memory. The set top box logic may check a version of the configuration 
information, and when the version does not match a version of configuration 
information stored in memory, the set top box may store the configuration 
information to memory. In other words, the set top box may replace obsolete 
configuration information in memory with more up to data configuration 
memory from the continuously-streaming initialization carousel. 

[0036] A set top box may include logic to locate a service group identifier in 

the QAM 321 , and to apply the service group identifier to locate configuration 
information 403-406 specific to set top boxes belonging to the service group. 

[0037] Content metadata and an index thereof may be provided via metadata 
channels (MMC) 323-325. In one embodiment, the content-on-demand 
server system 102 may compose content metadata and/or an index thereof 
expressed in Extensible Markup Language into an audio and/or video 
stream format (one or more streams). 

[0038] Alternative metadata channels 327-329 (triggers) may provide an alternative 
source of the metadata and/or index. 

[0039] To receive on-demand content, a set top box may tune to a channel 
333 comprising set top box configuration information having a streamed 
audio and/or video format. The set top box may locate among the 
configuration information a service group identifier for the set top box, and a 
channel 325 and an alternate channel 329 from which to retrieve a content 
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The set top box may then tune to the content index channel 325 and retrieve 
the content index. When the channel from which to retrieve the content index 
comprises an indication of invalid data (e.g. a predetermined bit pattern 
indicating invalid data in the channel), the set top box may tune to the 
alternate channel 329 for the content index. In this manner, the content 
index may be continually streamed via the channel 325, and when the data 
in the index becomes obsolete, the information of the channel 325 may be 
set to indicate invalid data. The set top box then tunes to the alternate 
channel 329 for the content index. 

[0040] QAM virtual channels (323 - 329, 331 , 333) can also include one or 
more video barkers 331. 

[0041] Figure 5 is a block diagram of an embodiment of a metadata channel 
502 comprising a content index 505. The content index 505 may comprise 
an index of content category metadata, and an index of "package" metadata, 
e.g. metadata for individual content titles. This index 505 may be applied by 
the set top box to generate a visual guide and/or menu of available content. 

[0042] In one embodiment the channel 502 may be organized into MPEG 

packet identifiers (PIDs), and the PIDs themselves organized into sections or 
segments. The channel 502 may include audio and video-type information 
(507 and 506, respectively) to facilitate tuning by the set top box. The 
channel 502 may include one or more advertisements and logos 504 to 
display with the content guide that is presented to users of the set top box. 

[0043] In one embodiment, the channel 502 may also comprise content 

category metadata 509, 511, 512, 514, 515. Content category metadata 509, 
511, 512, 514, 515 comprises information about categories of content such 
as adult, science fiction, horror, documentary, and so on. 

[0044] A set top box may locate in the content index a channel comprising 

audio and/or video title information (e.g. a content title metadata such as 323 
or 324 in Figure 3), and tune to the channel comprising the title metadata. 
The set top box may locate the audio and/or video title information in the 
channel, and apply at least a portion of the title metadata and the service 
group identifier in a request for content communicated to the server system. 
The server system may respond by communicating to the set top box a 
stream comprising the requested title. 
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comprises an indication of invalid data (e.g. a trigger), the set top box may 
tune to an alternate channel (e.g. channels 327 or 328 in Figure 3) 
comprising the title metadata. In this manner, the title metadata may be 
continually streamed via the channels 323-324, and when the title metadata 
becomes obsolete, the information of the channel 323 or 324 may be set to 
indicate invalid data, to cause the set top box to tune to the alternate 
channel comprising up-to-date title metadata. 
[0046] Thus, the content-on-demand server system may include logic to 

alternatively deliver content metadata on at least one first tunable channel 
and on at least one alternate tunable channel. Likewise, the server system 
may include logic to alternatively deliver an index of content metadata on a 
first tunable channel and on an alternate tunable channel. In this manner, 
updates to the content metadata and index may be accomplished without 
interruption or loss of information. 
[0047] In addition, to provide control over navigation and presentation of 

streamed content, the server system may include logic to insert markers in 
communicated audio and/or video streams. The meaning of the markers 
may vary. Some markers may indicate a position in the stream. Some 
markers may indicate a rating of content of the stream proximate to a 
position of the marker, typically following the marker. 
[0048] Figure 6 is a block diagram of an embodiment of an audio and/or video 
stream 602 including markers 604-607. A predetermined number of markers 
may be inserted in each time interval of the stream. This number may be 
provided with the set top box configuration information. 
[0049] One example of the use of stream markers is for pausing and resuming 
delivery of on-demand content. A set top box may communicate to the 
content-on-demand server system a marker for an audio and/or video 
stream for which the set top box has suspended viewing. The server system 
may then suspend delivery of the stream. Upon a later signal from the set 
top box, the server system may resume streaming of the audio and/or video 
stream from a position proximate to the marker. 
[0050] Some markers may indicate the proximity of advertisements in a 

stream. Logic of the set top box and/or server system may prevent certain 
navigation functions in the vicinity of advertisements, such as fast-forwarding 

-8- 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/050993 



PCT/US2004/037619 



pM^at&emmmts* Tbwmill»&& mWGffllffipf&h&m whereby 
set-top boxes my detect the end of an audio and/or video stream, and/or 
loss of stream content. 

[0051] Some markers may indicate the proximity of rated content. Logic of the 
set top box and/or server system may inhibit rendering (e.g. prevent viewing 
or hearing) of the rated content, or warn of the presence of the rated content, 
under certain circumstances, such as when the content has a mature rating. 

[0052] Figure 7 is a flow chart of an embodiment of an initialization of a set top 
box for video-on-demand. At 702 a content-on-demand feature of a set top 
box is initiated. The set top box tunes to the initialization carousel channel at 
704. Global configuration data for the set top box is read from the 
initialization carousel at 706. At 708, if the version of the configuration 
information matches the version of cached information, the process 
concludes at 718. Otherwise, a service group identifier for the set top box is 
extracted from the QAM at 712. Service group-specific information is then 
read from the initialization carousel at 716. 

[0053] Thus, a set top box may include logic to tune to a predetermined virtual 
channel number to retrieve set top box configuration information formatted 
as an audio and/or video stream. 

[0054] The set top box may include logic to apply the configuration information 
to affect operational settings of a set top box, when a version of the 
configuration information differs from a version of configuration information 
previously applied. 

[0055] The set top box may include logic to apply general configuration 

information to affect general operational settings of the set top box, and logic 
to locate a service group identifier for the set top box and to apply the 
service group identifier to locate service group specific configuration 
information for the set top box. The sen/ice group specific configuration 
information may also be formatted as an audio and/or video stream. 

[0056] Figure 8 is a flow chart of an embodiment of a process by which a set 
top box may display information about available audio and/or video streams. 

[0057] At 802 the set top box tunes to a channel comprising a content index, 
the content index formatted as streamed audio and/or video information. The 
index information is cached to a memory of the set top box at 804. At 806 a 
user of the set top box makes a request for content information, for example, 
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At 808 a channel comprising the content metadata is identified from the 
index. The set top box tunes to the channel comprising the content metadata 
at 810. At 812 the requested content information is displayed. 
[0058] Figure 9 is a flow chart of an embodiment of monitoring markers in an 
audio and/or video stream. At 904 the set top box tunes to the channel of the 
audio and/or video stream. If at 906 the tune is successful, the set top 
monitors the stream for markers at 908. Otherwise, the process concludes at 
920. 

[0059] If at 910 there are not sufficient markers in a predetermined time 

interval of the stream, the process concludes and reception and rendering of 
the stream may be terminated. 

[0060] Otherwise, a test is made at 91 2 for certain boundary conditions. 

Boundary conditions can include, among other things, being close to the end 
of the stream, or being close to an advertisement. Navigation is enabled at 
914 if a boundary condition is not detected. A check is also made at 916 for 
rating conditions, such as proximity of mature content. If there is a rating 
condition, rendering of the content is restricted, or a warning provided, at 
918. 

[0061] Thus, a set top box may include logic to scan an audio and/or video 
stream for markers. 

[0062] When an insufficient number of markers are detected within a time 
interval, rendering of the stream may be terminated (for example, by 
signaling the server system 102 to discontinue streaming of the content). 

[0063] When one or more markers indicate a restricting condition, rendering 
and/or navigation of the stream may be inhibited according to the restricting 
condition. 

[0064] Inhibiting navigation of the stream may include inhibiting at least one of 
fast forward, rewind, pausing, skipping, or playing of the stream. Inhibiting 
rendering of the stream may include inhibiting at least one of viewing or 
listening of the stream when the rating is mature content. 

[0065] Rendering and/or navigation of the audio and/or video stream may be 
re-enabled when another marker indicating an end to the restricting 
condition is encountered in the stream. 
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fast forward, rewind, pausing, skipping, or playing of the audio and/or video 
stream. Enabling rendering of the stream may include enabling at least one 
of viewing or listening of the stream when the rating is no longer mature 
content. 

[0067] Figure 1 0 is a block diagram of an embodiment of a manner of 

composing content metadata 1002 into an audio and/or video stream format, 
in this case an MPEG-2 format. 

[0068] The metadata 1002 is divided into smaller objects 1004-1006, and 

each chunk is provided with an MPEG NC object header 1008-1010. Each 
object 1004-1006 is further decomposed into segments, such as for object 
1005 segments 1012-1015. Each segment is provided with an MPEG NC 
segment header 1017-1020. Finally, each segment may be decomposed 
into MPEG transport packets of type DC2Text, MPEG private sections, 
and/or other packet types supported by the set top box. 

[0069] Unless the context clearly requires otherwise, throughout the 

description and the claims, the words "comprise," "comprising," and the like 
are to be construed in an inclusive sense as opposed to an exclusive or 
exhaustive sense; that is to say, in the sense of "including, but not limited 
to." Words using the singular or plural number also include the plural or 
singular number respectively. Additionally, the words "herein ," "above," 
"below" and words of similar import, when used in this application, shall refer 
to this application as a whole and not to any particular portions of this 
application. When the claims use the word "or in reference to a list of two or 
more items, that word covers all of the following interpretations of the word: 
any of the items in the list, all of the items in the list and any combination of 
the items in the list. 
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Claims 

What is claimed is: 

1 . A content on demand system comprising: 

logic to deliver at least one audio and/or video stream and to insert markers in the at 
least one stream, the markers indicating a position in the at least one stream; and 
logic to receive from a set top box a marker for an audio and/or video stream for 
which the set top box has paused or suspended viewing, and upon a signal from the 
set top box to resume streaming of the audio and/or video stream from a position 
proximate to the marker. 

2. A content on demand system comprising: 

logic to deliver at feast one audio and/or video stream and to insert markers in the at 
least one stream, the markers indicating a rating of content of the stream proximate 
to a position of the marker. 

3. A content on demand system comprising: 

logic to deliver at least one audio and/or video stream and to insert markers in the at 
least one stream, the markers indicating a proximity to advertisements in the at least 
one stream. 

4. A set top box comprising: 

logic to scan an audio and/or video stream for markers; and 

when one or more markers indicate a restricting condition, inhibiting rendering and/or 
navigation of the audio and/or video stream according to the restricting condition. 

5. The set top box of claim 4, wherein the restricting condition further comprises: 
proximity of an end of the audio and/or video stream. 
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6. The set top box of claim 4, wherein the restricting condition further comprises: 
proximity of an advertisement in the audio and/or video stream. 

7. The set top box of claim 4, wherein the restricting condition further comprises: 
a rating of content of the audio and/or video stream. 

8. The set top box of claim 4 f wherein the logic to inhibit rendering and/or navigation 
of the audio and/or video stream further comprises: 

logic to inhibit at least one of fast forward, rewind, pausing, skipping, or playing of the 
audio and/or video stream. 

9. The set top box of claim 4, wherein logic to inhibit rendering and/or navigation of 
the audio and/or video stream further comprises: 

logic to inhibit at least one of viewing or listening of the audio and/or video stream 
when the rating is mature content. 

10. The set top box of claim 4, further comprising: 

logic to terminate rendering of the audio and/or video stream when an insufficient 
number of markers are detected within a time interval. 

1 1 . The set top box of claim 10, further comprising: 

logic to enable rendering and/or navigation of the audio and/or video stream when a 
marker indicating an end to the restricting condition is encountered in the audio 
and/or video stream. 

12. The set top box of claim 11, wherein the restricting condition further comprises: 
proximity of an end of the audio and/or video stream. 
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1 3. The set top box of claim 1 1 , wherein the restricting condition further comprises: 
proximity of an advertisement in the audio and/or video stream. 

14. The set top box of claim 1 1 f wherein the restricting condition further comprises: 
a rating of content of the audio and/or video stream. 

1 5. The set top box of claim 1 1 , wherein the logic to enable rendering and/or 
navigation of the audio and/or video stream further comprises: 

the logic to enable at least one of fast forward, rewind, pausing, skipping, or playing 
of the audio and/or video stream. 

16. The set top box of claim 1 1 , wherein the logic to enable rendering and/or 
navigation of the audio and/or video stream further comprises: 

the logic to enable at least one of viewing or listening of the audio and/or video 
stream when the rating is no longer mature content. 
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